Introduction

Exploration univariée de chaque variable du jeu de données. Exploration par rapport à la cible. Nombre de valeur manquante.

Il y a 79 variables explicatives.

Il y a 23 variables nominale.

Il y a 23 variables ordinale.

Il y a 14 variables discrète.

Il y a 20 variables continue.

(j’ai essayé de traduire le libellé des variables rapidement …)

Exploration de la cible

SalePrice

train[, summary(SalePrice)]
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   34900  130000  163000  180900  214000  755000

LogSalePrice

train[, summary(logSalePrice)]
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   10.46   11.78   12.00   12.02   12.27   13.53

Liste des variables et valeurs manquantes

Parfois certaines variables ont des NA’s “logique”, à voir s’il y a des valeurs manquantes “missing at random”. Supression des variables dans ce tableau n’ayant aucune valeurs manquantes dans les deux sets.

set variable NA’s
train PoolQC 1453
test PoolQC 1456
train MiscFeature 1406
test MiscFeature 1408
train Alley 1369
test Alley 1352
train Fence 1179
test Fence 1169
train SalePrice 0
test SalePrice 1459
train logSalePrice 0
test logSalePrice 1459
train FireplaceQu 690
test FireplaceQu 730
train LotFrontage 259
test LotFrontage 227
train GarageYrBlt 81
test GarageYrBlt 78
train GarageFinish 81
test GarageFinish 78
train GarageQual 81
test GarageQual 78
train GarageCond 81
test GarageCond 78
train GarageType 81
test GarageType 76
train BsmtCond 37
test BsmtCond 45
train BsmtExposure 38
test BsmtExposure 44
train BsmtQual 37
test BsmtQual 44
train BsmtFinType2 38
test BsmtFinType2 42
train BsmtFinType1 37
test BsmtFinType1 42
train MasVnrType 8
test MasVnrType 16
train MasVnrArea 8
test MasVnrArea 15
train MSZoning 0
test MSZoning 4
train Utilities 0
test Utilities 2
train BsmtFullBath 0
test BsmtFullBath 2
train BsmtHalfBath 0
test BsmtHalfBath 2
train Functional 0
test Functional 2
train Exterior1st 0
test Exterior1st 1
train Exterior2nd 0
test Exterior2nd 1
train BsmtFinSF1 0
test BsmtFinSF1 1
train BsmtFinSF2 0
test BsmtFinSF2 1
train BsmtUnfSF 0
test BsmtUnfSF 1
train TotalBsmtSF 0
test TotalBsmtSF 1
train Electrical 1
test Electrical 0
train KitchenQual 0
test KitchenQual 1
train GarageCars 0
test GarageCars 1
train GarageArea 0
test GarageArea 1
train SaleType 0
test SaleType 1

Exploration univariée

MSSubClass (Nominal)

Identifies the type of dwelling involved in the sale
[FR] Qualifie le type de logement de facon macro

Tableau

MSSubClass Libelle Eff. train Eff. test % train % test
120 1-STORY PUD (Planned Unit Development) - 1946 & NEWER 87 95 5.96 6.51
150 1-1/2 STORY PUD - ALL AGES 0 1 0.00 0.07
160 2-STORY PUD - 1946 & NEWER 63 65 4.32 4.46
180 PUD - MULTILEVEL - INCL SPLIT LEV/FOYER 10 7 0.68 0.48
190 2 FAMILY CONVERSION - ALL STYLES AND AGES 30 31 2.05 2.12
20 1-STORY 1946 & NEWER ALL STYLES 536 543 36.71 37.22
30 1-STORY 1945 & OLDER 69 70 4.73 4.80
40 1-STORY W/FINISHED ATTIC ALL AGES 4 2 0.27 0.14
45 1-1/2 STORY - UNFINISHED ALL AGES 12 6 0.82 0.41
50 1-1/2 STORY FINISHED ALL AGES 144 143 9.86 9.80
60 2-STORY 1946 & NEWER 299 276 20.48 18.92
70 2-STORY 1945 & OLDER 60 68 4.11 4.66
75 2-1/2 STORY ALL AGES 16 7 1.10 0.48
80 SPLIT OR MULTI-LEVEL 58 60 3.97 4.11
85 SPLIT FOYER 20 28 1.37 1.92
90 DUPLEX - ALL STYLES AND AGES 52 57 3.56 3.91
Total Total 1460 1459 99.99 100.01

Graphiques



MSZoning (Nominal)

Identifies the general zoning classification of the sale
[FR] Qualifie le type de zone du logement

Tableau

MSZoning Libelle Eff. train Eff. test % train % test
NA NA 0 4 0.00 0.27
C (all) Commercial 10 15 0.68 1.03
FV Floating Village Residential 65 74 4.45 5.07
RH Residential High Density 16 10 1.10 0.69
RL Residential Low Density 1151 1114 78.84 76.35
RM Residential Medium Density 218 242 14.93 16.59
Total Total 1460 1459 100.00 100.00

Graphiques



LotFrontage (Continuous)

Linear feet of street connected to property
[FR] Mesure en pied la distance commune entre la rue et la propriété

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 21 59 69 80 313 259
test 21 58 67 80 200 227

Graphiques

## `geom_smooth()` using method = 'gam'


LotArea (Continuous)

Lot size in square feet
[FR] Surface de la propriété en pied2

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 1300 7553.5 9478.5 11601.5 215245 0
test 1470 7391.0 9399.0 11517.5 56600 0

Graphiques

## `geom_smooth()` using method = 'gam'


Street (Nominal)

Type of road access to property
[FR] Type de la route pour accèder à la propriété

Tableau

Street Libelle Eff. train Eff. test % train % test
Grvl Gravel 6 6 0.41 0.41
Pave Paved 1454 1453 99.59 99.59
Total Total 1460 1459 100.00 100.00

Graphiques



Alley (Nominal)

Type of alley access to property
[FR] Type de l’allée pour acceder à la propriété

Tableau

Alley Libelle Eff. train Eff. test % train % test
NA No alley access 1369 1352 93.77 92.67
Grvl Gravel 50 70 3.42 4.80
Pave Paved 41 37 2.81 2.54
Total Total 1460 1459 100.00 100.01

Graphiques



LotShape (Ordinal)

General shape of property
[FR] Forme générale de la propriété

Tableau

LotShape Libelle Eff. train Eff. test % train % test
IR1 Slightly irregular 484 484 33.15 33.17
IR2 Moderately Irregular 41 35 2.81 2.40
IR3 Irregular 10 6 0.68 0.41
Reg Regular 925 934 63.36 64.02
Total Total 1460 1459 100.00 100.00

Graphiques



LandContour (Nominal)

Flatness of the property
[FR] Platitude du sol de la propriété

Tableau

LandContour Libelle Eff. train Eff. test % train % test
Bnk Banked - Quick and significant rise from street grade to building 63 54 4.32 3.70
HLS Hillside - Significant slope from side to side 50 70 3.42 4.80
Low Depression 36 24 2.47 1.64
Lvl Near Flat/Level 1311 1311 89.79 89.86
Total Total 1460 1459 100.00 100.00

Graphiques



Utilities (Ordinal)

Type of utilities available
[FR] Type de service publiques à disposition

Tableau

Utilities Libelle Eff. train Eff. test % train % test
NA NA 0 2 0.00 0.14
AllPub All public Utilities (E 1459 1457 99.93 99.86
NoSeWa Electricity and Gas Only 1 0 0.07 0.00
Total Total 1460 1459 100.00 100.00

Graphiques



LotConfig (Nominal)

Lot configuration
[FR] Configuration de la propriété

Tableau

LotConfig Libelle Eff. train Eff. test % train % test
Corner Corner lot 263 248 18.01 17.00
CulDSac Cul-de-sac 94 82 6.44 5.62
FR2 Frontage on 2 sides of property 47 38 3.22 2.60
FR3 Frontage on 3 sides of property 4 10 0.27 0.69
Inside Inside lot 1052 1081 72.05 74.09
Total Total 1460 1459 99.99 100.00

Graphiques



LandSlope (Ordinal)

Slope of property
[FR] Pente de la propriété

Tableau

LandSlope Libelle Eff. train Eff. test % train % test
Gtl Gentle slope 1382 1396 94.66 95.68
Mod Moderate slope 65 60 4.45 4.11
Sev Severe slope 13 3 0.89 0.21
Total Total 1460 1459 100.00 100.00

Graphiques



Neighborhood (Nominal)

Physical locations within Ames city limits (map available)
[FR] Localisation (quartier) de la propriété

Tableau

Neighborhood Libelle Eff. train Eff. test % train % test
Blmngtn Bloomington Heights 17 11 1.16 0.75
Blueste Bluestem 2 8 0.14 0.55
BrDale Briardale 16 14 1.10 0.96
BrkSide Brookside 58 50 3.97 3.43
ClearCr Clear Creek 28 16 1.92 1.10
CollgCr College Creek 150 117 10.27 8.02
Crawfor Crawford 51 52 3.49 3.56
Edwards Edwards 100 94 6.85 6.44
Gilbert Gilbert 79 86 5.41 5.89
IDOTRR Iowa DOT and Rail Road 37 56 2.53 3.84
MeadowV Meadow Village 17 20 1.16 1.37
Mitchel Mitchell 49 65 3.36 4.46
NAmes North Ames 225 218 15.41 14.94
NPkVill Northpark Villa 9 14 0.62 0.96
NWAmes Northwest Ames 73 58 5.00 3.98
NoRidge Northridge 41 30 2.81 2.06
NridgHt Northridge Heights 77 89 5.27 6.10
OldTown Old Town 113 126 7.74 8.64
SWISU South & West of Iowa State University 25 23 1.71 1.58
Sawyer Sawyer 74 77 5.07 5.28
SawyerW Sawyer West 59 66 4.04 4.52
Somerst Somerset 86 96 5.89 6.58
StoneBr Stone Brook 25 26 1.71 1.78
Timber Timberland 38 34 2.60 2.33
Veenker Veenker 11 13 0.75 0.89
Total Total 1460 1459 99.98 100.01

Graphiques



Condition1 (Nominal)

Proximity to various conditions
[FR] Proche de plusieurs type de condition

Tableau

Condition1 Libelle Eff. train Eff. test % train % test
Artery Adjacent to arterial street 48 44 3.29 3.02
Feedr Adjacent to feeder street 81 83 5.55 5.69
Norm Normal 1260 1251 86.30 85.74
PosA Adjacent to postive off-site feature 8 12 0.55 0.82
PosN Near positive off-site feature–park 19 20 1.30 1.37
RRAe Adjacent to East-West Railroad 11 17 0.75 1.17
RRAn Adjacent to North-South Railroad 26 24 1.78 1.64
RRNe Within 200’ of East-West Railroad 2 4 0.14 0.27
RRNn Within 200’ of North-South Railroad 5 4 0.34 0.27
Total Total 1460 1459 100.00 99.99

Graphiques



Condition2 (Nominal)

Proximity to various conditions (if more than one is present)
[FR] Proche de plusieurs type de condition

Tableau

Condition2 Libelle Eff. train Eff. test % train % test
Artery Adjacent to arterial street 2 3 0.14 0.21
Feedr Adjacent to feeder street 6 7 0.41 0.48
Norm Normal 1445 1444 98.97 98.97
PosA Adjacent to postive off-site feature 1 3 0.07 0.21
PosN Near positive off-site feature–park 2 2 0.14 0.14
RRAe Adjacent to East-West Railroad 1 0 0.07 0.00
RRAn Adjacent to North-South Railroad 1 0 0.07 0.00
RRNn Within 200’ of North-South Railroad 2 0 0.14 0.00
Total Total 1460 1459 100.01 100.01

Graphiques



BldgType (Nominal)

Type of dwelling
[FR] Type du batiment

Tableau

BldgType Libelle Eff. train Eff. test % train % test
1Fam Single-family Detached 1220 1205 83.56 82.59
2fmCon Two-family Conversion; originally built as one-family dwelling 31 31 2.12 2.12
Duplex Duplex 52 57 3.56 3.91
Twnhs Townhouse Inside Unit 43 53 2.95 3.63
TwnhsE Townhouse End Unit 114 113 7.81 7.75
Total Total 1460 1459 100.00 100.00

Graphiques



HouseStyle (Nominal)

Style of dwelling
[FR] Style du batiment

Tableau

HouseStyle Libelle Eff. train Eff. test % train % test
1.5Fin One and one-half story: 2nd level finished 154 160 10.55 10.97
1.5Unf One and one-half story: 2nd level unfinished 14 5 0.96 0.34
1Story One story 726 745 49.73 51.06
2.5Fin Two and one-half story: 2nd level finished 8 0 0.55 0.00
2.5Unf Two and one-half story: 2nd level unfinished 11 13 0.75 0.89
2Story Two story 445 427 30.48 29.27
SFoyer Split Foyer 37 46 2.53 3.15
SLvl Split Level 65 63 4.45 4.32
Total Total 1460 1459 100.00 100.00

Graphiques



OverallQual (Ordinal)

Rates the overall material and finish of the house
[FR] Note general du batiment et de la finition de la maison

Tableau

OverallQual Libelle Eff. train Eff. test % train % test
1 Very Poor 2 2 0.14 0.14
10 Very Excellent 18 13 1.23 0.89
2 Poor 3 10 0.21 0.69
3 Fair 20 20 1.37 1.37
4 Below Average 116 110 7.95 7.54
5 Average 397 428 27.19 29.34
6 Above Average 374 357 25.62 24.47
7 Good 319 281 21.85 19.26
8 Very Good 168 174 11.51 11.93
9 Excellent 43 64 2.95 4.39
Total Total 1460 1459 100.02 100.02

Graphiques



OverallCond (Ordinal)

Rates the overall condition of the house
[FR] Note general de l’emplacement de la maison

Tableau

OverallCond Libelle Eff. train Eff. test % train % test
1 Very Poor 1 6 0.07 0.41
2 Poor 5 5 0.34 0.34
3 Fair 25 25 1.71 1.71
4 Below Average 57 44 3.90 3.02
5 Average 821 824 56.23 56.48
6 Above Average 252 279 17.26 19.12
7 Good 205 185 14.04 12.68
8 Very Good 72 72 4.93 4.93
9 Excellent 22 19 1.51 1.30
Total Total 1460 1459 99.99 99.99

Graphiques



YearBuilt (Discrete)

Original construction date
[FR] Date de la construction

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 1872 1954 1973 2000 2010 0
test 1879 1953 1973 2001 2010 0

Graphiques

## `geom_smooth()` using method = 'gam'


YearRemodAdd (Discrete)

Remodel date (same as construction date if no remodeling or additions)
[FR] Date de modification (même que YearBuilt si aucune modification)

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 1950 1967 1994 2004 2010 0
test 1950 1963 1992 2004 2010 0

Graphiques

## `geom_smooth()` using method = 'gam'


RoofStyle (Nominal)

Type of roof
[FR] Type de toit

Tableau

RoofStyle Libelle Eff. train Eff. test % train % test
Flat Flat 13 7 0.89 0.48
Gable Gable 1141 1169 78.15 80.12
Gambrel Gabrel (Barn) 11 11 0.75 0.75
Hip Hip 286 265 19.59 18.16
Mansard Mansard 7 4 0.48 0.27
Shed Shed 2 3 0.14 0.21
Total Total 1460 1459 100.00 99.99

Graphiques



RoofMatl (Nominal)

Roof material
[FR] Materiaux de toiture

Tableau

RoofMatl Libelle Eff. train Eff. test % train % test
ClyTile Clay or Tile 1 0 0.07 0.00
CompShg Standard (Composite) Shingle 1434 1442 98.22 98.83
Membran Membrane 1 0 0.07 0.00
Metal Metal 1 0 0.07 0.00
Roll Roll 1 0 0.07 0.00
Tar&Grv Gravel & Tar 11 12 0.75 0.82
WdShake Wood Shakes 5 4 0.34 0.27
WdShngl Wood Shingles 6 1 0.41 0.07
Total Total 1460 1459 100.00 99.99

Graphiques



Exterior1st (Nominal)

Exterior covering on house
[FR] Revêtement facade

Tableau

Exterior1st Libelle Eff. train Eff. test % train % test
NA NA 0 1 0.00 0.07
AsbShng Asbestos Shingles 20 24 1.37 1.64
AsphShn Asphalt Shingles 1 1 0.07 0.07
BrkComm Brick Common 2 4 0.14 0.27
BrkFace Brick Face 50 37 3.42 2.54
CBlock Cinder Block 1 1 0.07 0.07
CemntBd Cement Board 61 65 4.18 4.46
HdBoard Hard Board 222 220 15.21 15.08
ImStucc Imitation Stucco 1 0 0.07 0.00
MetalSd Metal Siding 220 230 15.07 15.76
Plywood Plywood 108 113 7.40 7.75
Stone Stone 2 0 0.14 0.00
Stucco Stucco 25 18 1.71 1.23
VinylSd Vinyl Siding 515 510 35.27 34.96
Wd Sdng Wood Siding 206 205 14.11 14.05
WdShing Wood Shingles 26 30 1.78 2.06
Total Total 1460 1459 100.01 100.01

Graphiques



Exterior2nd (Nominal)

Exterior covering on house (if more than one material)
[FR] Revêtement facade (si plus d’un revêtement)

Tableau

Exterior2nd Libelle Eff. train Eff. test % train % test
NA NA 0 1 0.00 0.07
AsbShng Asbestos Shingles 20 18 1.37 1.23
AsphShn Asphalt Shingles 3 1 0.21 0.07
Brk Cmn Brick Common 7 15 0.48 1.03
BrkFace Brick Face 25 22 1.71 1.51
CBlock Cinder Block 1 2 0.07 0.14
CmentBd Cement Board 60 66 4.11 4.52
HdBoard Hard Board 207 199 14.18 13.64
ImStucc Imitation Stucco 10 5 0.68 0.34
MetalSd Metal Siding 214 233 14.66 15.97
Other Other 1 0 0.07 0.00
Plywood Plywood 142 128 9.73 8.77
Stone Stone 5 1 0.34 0.07
Stucco Stucco 26 21 1.78 1.44
VinylSd Vinyl Siding 504 510 34.52 34.96
Wd Sdng Wood Siding 197 194 13.49 13.30
Wd Shng Wood Shingles 38 43 2.60 2.95
Total Total 1460 1459 100.00 100.01

Graphiques



MasVnrType (Nominal)

Masonry veneer type
[FR] Type placage en maconnerie

Tableau

MasVnrType Libelle Eff. train Eff. test % train % test
NA NA 8 16 0.55 1.10
BrkCmn Brick Common 15 10 1.03 0.69
BrkFace Brick Face 445 434 30.48 29.75
None None 864 878 59.18 60.18
Stone Stone 128 121 8.77 8.29
Total Total 1460 1459 100.01 100.01

Graphiques



MasVnrArea (Continuous)

Masonry veneer area in square feet
[FR] Surface de placage en maconnerie en pied au carre

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 166 1600 8
test 0 0 0 164 1290 15

Graphiques

## `geom_smooth()` using method = 'gam'


ExterQual (Ordinal)

Evaluates the quality of the material on the exterior
[FR] Evalue la qualité des matériaux extérieures

Tableau

ExterQual Libelle Eff. train Eff. test % train % test
Ex Excellent 52 55 3.56 3.77
Fa Fair 14 21 0.96 1.44
Gd Good 488 491 33.42 33.65
TA Average/Typical 906 892 62.05 61.14
Total Total 1460 1459 99.99 100.00

Graphiques



ExterCond (Ordinal)

Evaluates the present condition of the material on the exterior
[FR] Evalue l’état réel des matériaux extérieurs

Tableau

ExterCond Libelle Eff. train Eff. test % train % test
Ex Excellent 3 9 0.21 0.62
Fa Fair 28 39 1.92 2.67
Gd Good 146 153 10.00 10.49
Po Poor 1 2 0.07 0.14
TA Average/Typical 1282 1256 87.81 86.09
Total Total 1460 1459 100.01 100.01

Graphiques



Foundation (Nominal)

Type of foundation
[FR] Type de fondation

Tableau

Foundation Libelle Eff. train Eff. test % train % test
BrkTil Brick & Tile 146 165 10.00 11.31
CBlock Cinder Block 634 601 43.42 41.19
PConc Poured Contrete 647 661 44.32 45.31
Slab Slab 24 25 1.64 1.71
Stone Stone 6 5 0.41 0.34
Wood Wood 3 2 0.21 0.14
Total Total 1460 1459 100.00 100.00

Graphiques



BsmtQual (Ordinal)

Evaluates the height of the basement
[FR] Evalue la taille du sous-sol (dans le sens socle de fondation)

Tableau

BsmtQual Libelle Eff. train Eff. test % train % test
NA No Basement 37 44 2.53 3.02
Ex Excellent (100+ inches) 121 137 8.29 9.39
Fa Fair (70-79 inches) 35 53 2.40 3.63
Gd Good (90-99 inches) 618 591 42.33 40.51
TA Typical (80-89 inches) 649 634 44.45 43.45
Total Total 1460 1459 100.00 100.00

Graphiques



BsmtCond (Ordinal)

Evaluates the general condition of the basement
[FR] Evalue les condition general du sous-sol (dans le sens socle de fondation)

Tableau

BsmtCond Libelle Eff. train Eff. test % train % test
NA No Basement 37 45 2.53 3.08
Fa Fair - dampness or some cracking or settling 45 59 3.08 4.04
Gd Good 65 57 4.45 3.91
Po Poor - Severe cracking 2 3 0.14 0.21
TA Typical - slight dampness allowed 1311 1295 89.79 88.76
Total Total 1460 1459 99.99 100.00

Graphiques



BsmtExposure (Ordinal)

Refers to walkout or garden level walls
[FR] Walkout = sous-sol à entrée directe

Tableau

BsmtExposure Libelle Eff. train Eff. test % train % test
NA No Basement 38 44 2.60 3.02
Av Average Exposure (split levels or foyers typically score average or above) 221 197 15.14 13.50
Gd Good Exposure 134 142 9.18 9.73
Mn Mimimum Exposure 114 125 7.81 8.57
No No Exposure 953 951 65.27 65.18
Total Total 1460 1459 100.00 100.00

Graphiques



BsmtFinType1 (Ordinal)

Rating of basement finished area
[FR] Note du sous-sol (est-ce vivable?)

Tableau

BsmtFinType1 Libelle Eff. train Eff. test % train % test
NA No Basement 37 42 2.53 2.88
ALQ Average Living Quarters 220 209 15.07 14.32
BLQ Below Average Living Quarters 148 121 10.14 8.29
GLQ Good Living Quarters 418 431 28.63 29.54
LwQ Low Quality 74 80 5.07 5.48
Rec Average Rec Room 133 155 9.11 10.62
Unf Unfinshed 430 421 29.45 28.86
Total Total 1460 1459 100.00 99.99

Graphiques



BsmtFinSF1 (Continuous)

Type 1 finished square feet
[FR] Surface finie (habitable?) du sous-sol de type 1

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 383.5 712.25 5644 0
test 0 0 350.5 753.50 4010 1

Graphiques

## `geom_smooth()` using method = 'gam'


BsmtFinType2 (Ordinal)

Rating of basement finished area (if multiple types)
[FR] Note du sous-sol (est-ce vivable?) type 2 si plusieurs types

Tableau

BsmtFinType2 Libelle Eff. train Eff. test % train % test
NA No Basement 38 42 2.60 2.88
ALQ Average Living Quarters 19 33 1.30 2.26
BLQ Below Average Living Quarters 33 35 2.26 2.40
GLQ Good Living Quarters 14 20 0.96 1.37
LwQ Low Quality 46 41 3.15 2.81
Rec Average Rec Room 54 51 3.70 3.50
Unf Unfinshed 1256 1237 86.03 84.78
Total Total 1460 1459 100.00 100.00

Graphiques



BsmtFinSF2 (Continuous)

Type 2 finished square feet
[FR] Surface finie (habitable?) du sous-sol de type 2

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 0 1474 0
test 0 0 0 0 1526 1

Graphiques

## `geom_smooth()` using method = 'gam'


BsmtUnfSF (Continuous)

Unfinished square feet of basement area
[FR] Surface non finie du sous-sol (non habitable?)

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 223.00 477.5 808.00 2336 0
test 0 219.25 460.0 797.75 2140 1

Graphiques

## `geom_smooth()` using method = 'gam'


TotalBsmtSF (Continuous)

Total square feet of basement area
[FR] Surface totale du sous-sol

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 795.75 991.5 1298.25 6110 0
test 0 784.00 988.0 1305.00 5095 1

Graphiques

## `geom_smooth()` using method = 'gam'


Heating (Nominal)

Type of heating
[FR] Type de chauffage

Tableau

Heating Libelle Eff. train Eff. test % train % test
Floor Floor Furnace 1 0 0.07 0.00
GasA Gas forced warm air furnace 1428 1446 97.81 99.11
GasW Gas hot water or steam heat 18 9 1.23 0.62
Grav Gravity furnace 7 2 0.48 0.14
OthW Hot water or steam heat other than gas 2 0 0.14 0.00
Wall Wall furnace 4 2 0.27 0.14
Total Total 1460 1459 100.00 100.01

Graphiques



HeatingQC (Ordinal)

Heating quality and condition
[FR] Qualité et état du chauffage

Tableau

HeatingQC Libelle Eff. train Eff. test % train % test
Ex Excellent 741 752 50.75 51.54
Fa Fair 49 43 3.36 2.95
Gd Good 241 233 16.51 15.97
Po Poor 1 2 0.07 0.14
TA Average/Typical 428 429 29.32 29.40
Total Total 1460 1459 100.01 100.00

Graphiques



CentralAir (Nominal)

Central air conditioning
[FR] Air conditionné

Tableau

CentralAir Libelle Eff. train Eff. test % train % test
N No 95 101 6.51 6.92
Y Yes 1365 1358 93.49 93.08
Total Total 1460 1459 100.00 100.00

Graphiques



Electrical (Ordinal)

Electrical system
[FR] Systeme electrique

Tableau

Electrical Libelle Eff. train Eff. test % train % test
NA NA 1 0 0.07 0.00
FuseA Fuse Box over 60 AMP and all Romex wiring (Average) 94 94 6.44 6.44
FuseF 60 AMP Fuse Box and mostly Romex wiring (Fair) 27 23 1.85 1.58
FuseP 60 AMP Fuse Box and mostly knob & tube wiring (poor) 3 5 0.21 0.34
Mix Mixed 1 0 0.07 0.00
SBrkr Standard Circuit Breakers & Romex 1334 1337 91.37 91.64
Total Total 1460 1459 100.01 100.00

Graphiques



1stFlrSF (Continuous)

First Floor square feet
[FR] Surface du premier etage

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 334 882.0 1087 1391.25 4692 0
test 407 873.5 1079 1382.50 5095 0

Graphiques

## `geom_smooth()` using method = 'gam'


2ndFlrSF (Continuous)

Second Floor square feet
[FR] Surface du second etage

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 728 2065 0
test 0 0 0 676 1862 0

Graphiques

## `geom_smooth()` using method = 'gam'


LowQualFinSF (Continuous)

Low quality finished square feet (all floors)
[FR] Surface globale ?

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 0 572 0
test 0 0 0 0 1064 0

Graphiques

## `geom_smooth()` using method = 'gam'


GrLivArea (Continuous)

Above grade (ground) living area square feet
[FR] Surface globale hormis sous-sol ?

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 334 1129.5 1464 1776.75 5642 0
test 407 1117.5 1432 1721.00 5095 0

Graphiques

## `geom_smooth()` using method = 'gam'


BsmtFullBath (Discrete)

Basement full bathrooms
[FR] Nombre de salle de bain complète au sous sol

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 1 3 0
test 0 0 0 1 3 2

Graphiques

## `geom_smooth()` using method = 'gam'


BsmtHalfBath (Discrete)

Basement half bathrooms
[FR] Nombre de salle de bain au sous sol

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 0 2 0
test 0 0 0 0 2 2

Graphiques

## `geom_smooth()` using method = 'gam'


FullBath (Discrete)

Full bathrooms above grade
[FR] Nombre de salle de bain complète sans compter le sous sol

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 1 2 2 3 0
test 0 1 2 2 4 0

Graphiques

## `geom_smooth()` using method = 'gam'


HalfBath (Discrete)

Half bathrooms above grade
[FR] Nombre de salle de bain sans compter le sous sol

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 1 2 0
test 0 0 0 1 2 0

Graphiques

## `geom_smooth()` using method = 'gam'


BedroomAbvGr (Discrete)

Bedrooms above grade (does NOT include basement bedrooms)
[FR] Nombre de chambre en dehors du sous sol

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 2 3 3 8 0
test 0 2 3 3 6 0

Graphiques

## `geom_smooth()` using method = 'gam'


KitchenAbvGr (Discrete)

Kitchens above grade
[FR] Nombre de cuisine

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 1 1 1 3 0
test 0 1 1 1 2 0

Graphiques

## `geom_smooth()` using method = 'gam'


KitchenQual (Ordinal)

Kitchen quality
[FR] Qualité cuisine

Tableau

KitchenQual Libelle Eff. train Eff. test % train % test
NA NA 0 1 0.00 0.07
Ex Excellent 100 105 6.85 7.20
Fa Fair 39 31 2.67 2.12
Gd Good 586 565 40.14 38.73
TA Average/Typical 735 757 50.34 51.88
Total Total 1460 1459 100.00 100.00

Graphiques



TotRmsAbvGrd (Discrete)

Total rooms above grade (does not include bathrooms)
[FR] Nombre de pièce hors sous-sol et salle de bain

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 2 5 6 7 14 0
test 3 5 6 7 15 0

Graphiques

## `geom_smooth()` using method = 'gam'


Functional (Ordinal)

Home functionality (Assume typical unless deductions are warranted)
[FR] ??

Tableau

Functional Libelle Eff. train Eff. test % train % test
NA NA 0 2 0.00 0.14
Maj1 Major Deductions 1 14 5 0.96 0.34
Maj2 Major Deductions 2 5 4 0.34 0.27
Min1 Minor Deductions 1 31 34 2.12 2.33
Min2 Minor Deductions 2 34 36 2.33 2.47
Mod Moderate Deductions 15 20 1.03 1.37
Sev Severely Damaged 1 1 0.07 0.07
Typ Typical Functionality 1360 1357 93.15 93.01
Total Total 1460 1459 100.00 100.00

Graphiques



Fireplaces (Discrete)

Number of fireplaces
[FR] Nombre de cheminée

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 1 1 3 0
test 0 0 0 1 4 0

Graphiques

## `geom_smooth()` using method = 'gam'


FireplaceQu (Ordinal)

Fireplace quality
[FR] Qualité cheminée

Tableau

FireplaceQu Libelle Eff. train Eff. test % train % test
NA No Fireplace 690 730 47.26 50.03
Ex Excellent - Exceptional Masonry Fireplace 24 19 1.64 1.30
Fa Fair - Prefabricated Fireplace in basement 33 41 2.26 2.81
Gd Good - Masonry Fireplace in main level 380 364 26.03 24.95
Po Poor - Ben Franklin Stove 20 26 1.37 1.78
TA Average - Prefabricated Fireplace in main living area or Masonry Fireplace in basement 313 279 21.44 19.12
Total Total 1460 1459 100.00 99.99

Graphiques



GarageType (Nominal)

Garage location
[FR] localisation du garage

Tableau

GarageType Libelle Eff. train Eff. test % train % test
NA No Garage 81 76 5.55 5.21
2Types More than one type of garage 6 17 0.41 1.17
Attchd Attached to home 870 853 59.59 58.46
Basment Basement Garage 19 17 1.30 1.17
BuiltIn Built-In (Garage part of house - typically has room above garage) 88 98 6.03 6.72
CarPort Car Port 9 6 0.62 0.41
Detchd Detached from home 387 392 26.51 26.87
Total Total 1460 1459 100.01 100.01

Graphiques



GarageYrBlt (Discrete)

Year garage was built
[FR] Année de construction du garage

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 1900 1961 1980 2002 2010 81
test 1895 1959 1979 2002 2207 78

Graphiques

## `geom_smooth()` using method = 'gam'


GarageFinish (Ordinal)

Interior finish of the garage
[FR] Finition intérieure du garage

Tableau

GarageFinish Libelle Eff. train Eff. test % train % test
NA No Garage 81 78 5.55 5.35
Fin Finished 352 367 24.11 25.15
RFn Rough Finished 422 389 28.90 26.66
Unf Unfinished 605 625 41.44 42.84
Total Total 1460 1459 100.00 100.00

Graphiques



GarageCars (Discrete)

Size of garage in car capacity
[FR] Taille du garage en capacité de nombre de voiture

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 1 2 2 4 0
test 0 1 2 2 5 1

Graphiques

## `geom_smooth()` using method = 'gam'


GarageArea (Continuous)

Size of garage in square feet
[FR] Taille du garage en pied carré

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 334.5 480 576 1418 0
test 0 318.0 480 576 1488 1

Graphiques

## `geom_smooth()` using method = 'gam'


GarageQual (Ordinal)

Garage quality
[FR] Qualité du garage

Tableau

GarageQual Libelle Eff. train Eff. test % train % test
NA No Garage 81 78 5.55 5.35
Ex Excellent 3 0 0.21 0.00
Fa Fair 48 76 3.29 5.21
Gd Good 14 10 0.96 0.69
Po Poor 3 2 0.21 0.14
TA Average/Typical 1311 1293 89.79 88.62
Total Total 1460 1459 100.01 100.01

Graphiques



GarageCond (Ordinal)

Garage condition
[FR] Etat du garage

Tableau

GarageCond Libelle Eff. train Eff. test % train % test
NA No Garage 81 78 5.55 5.35
Ex Excellent 2 1 0.14 0.07
Fa Fair 35 39 2.40 2.67
Gd Good 9 6 0.62 0.41
Po Poor 7 7 0.48 0.48
TA Average/Typical 1326 1328 90.82 91.02
Total Total 1460 1459 100.01 100.00

Graphiques



PavedDrive (Ordinal)

Paved driveway
[FR] Allée pavée

Tableau

PavedDrive Libelle Eff. train Eff. test % train % test
N Dirt/Gravel 90 126 6.16 8.64
P Partial Pavement 30 32 2.05 2.19
Y Paved 1340 1301 91.78 89.17
Total Total 1460 1459 99.99 100.00

Graphiques



WoodDeckSF (Continuous)

Wood deck area in square feet
[FR] Surface terasse en bois en pied carre

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 168 857 0
test 0 0 0 168 1424 0

Graphiques

## `geom_smooth()` using method = 'gam'


OpenPorchSF (Continuous)

Open porch area in square feet
[FR] Surface du perron (ouvert) en pied carre

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 25 68 547 0
test 0 0 28 72 742 0

Graphiques

## `geom_smooth()` using method = 'gam'


EnclosedPorch (Continuous)

Enclosed porch area in square feet
[FR] Surface du perron (fermé, veranda?) en pied carre

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 0 552 0
test 0 0 0 0 1012 0

Graphiques

## `geom_smooth()` using method = 'gam'


3SsnPorch (Continuous)

Three season porch area in square feet
[FR] Surface veranda en pied carre

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 0 508 0
test 0 0 0 0 360 0

Graphiques

## `geom_smooth()` using method = 'gam'


ScreenPorch (Continuous)

Screen porch area in square feet
[FR] Surface veranda (avec moustiquaire) en pied carre

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 0 480 0
test 0 0 0 0 576 0

Graphiques

## `geom_smooth()` using method = 'gam'


PoolArea (Continuous)

Pool area in square feet
[FR] Surface de la piscine en pied carre

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 0 738 0
test 0 0 0 0 800 0

Graphiques

## `geom_smooth()` using method = 'gam'


PoolQC (Ordinal)

Pool quality
[FR] Qualité de la piscine

Tableau

PoolQC Libelle Eff. train Eff. test % train % test
NA No Pool 1453 1456 99.52 99.79
Ex Excellent 2 2 0.14 0.14
Fa Fair 2 0 0.14 0.00
Gd Good 3 1 0.21 0.07
Total Total 1460 1459 100.01 100.00

Graphiques



Fence (Ordinal)

Fence quality
[FR] Qualité clôture

Tableau

Fence Libelle Eff. train Eff. test % train % test
NA No Fence 1179 1169 80.75 80.12
GdPrv Good Privacy 59 59 4.04 4.04
GdWo Good Wood 54 58 3.70 3.98
MnPrv Minimum Privacy 157 172 10.75 11.79
MnWw Minimum Wood/Wire 11 1 0.75 0.07
Total Total 1460 1459 99.99 100.00

Graphiques



MiscFeature (Nominal)

Miscellaneous feature not covered in other categories
[FR] Autres propriétés non mentionné par les autres variables

Tableau

MiscFeature Libelle Eff. train Eff. test % train % test
NA None 1406 1408 96.30 96.50
Gar2 2nd Garage (if not described in garage section) 2 3 0.14 0.21
Othr Other 2 2 0.14 0.14
Shed Shed (over 100 SF) 49 46 3.36 3.15
TenC Tennis Court 1 0 0.07 0.00
Total Total 1460 1459 100.01 100.00

Graphiques



MiscVal (Continuous)

$Value of miscellaneous feature
[FR] Valeur monétaire des propriétés non mentionnées

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 0 0 0 0 15500 0
test 0 0 0 0 17000 0

Graphiques

## `geom_smooth()` using method = 'gam'


MoSold (Discrete)

Month Sold (MM)
[FR] Mois de vente

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 1 5 6 8 12 0
test 1 4 6 8 12 0

Graphiques

## `geom_smooth()` using method = 'gam'


YrSold (Discrete)

Year Sold (YYYY)
[FR] Année de vente

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 2006 2007 2008 2009 2010 0
test 2006 2007 2008 2009 2010 0

Graphiques

## `geom_smooth()` using method = 'gam'


SaleType (Nominal)

Type of sale
[FR] Type de vente

Tableau

SaleType Libelle Eff. train Eff. test % train % test
NA NA 0 1 0.00 0.07
COD Court Officer Deed/Estate 43 44 2.95 3.02
CWD Warranty Deed - Cash 4 8 0.27 0.55
Con Contract 15% Down payment regular terms 2 3 0.14 0.21
ConLD Contract Low Down 9 17 0.62 1.17
ConLI Contract Low Interest 5 4 0.34 0.27
ConLw Contract Low Down payment and low interest 5 3 0.34 0.21
New Home just constructed and sold 122 117 8.36 8.02
Oth Other 3 4 0.21 0.27
WD Warranty Deed - Conventional 1267 1258 86.78 86.22
Total Total 1460 1459 100.01 100.01

Graphiques



SaleCondition (Nominal)

Condition of sale
[FR] Condition de vente

Tableau

SaleCondition Libelle Eff. train Eff. test % train % test
Abnorml Abnormal Sale - trade 101 89 6.92 6.10
AdjLand Adjoining Land Purchase 4 8 0.27 0.55
Alloca Allocation - two linked properties with separate deeds 12 12 0.82 0.82
Family Sale between family members 20 26 1.37 1.78
Normal Normal Sale 1198 1204 82.05 82.52
Partial Home was not completed when last assessed (associated with New Homes) 125 120 8.56 8.22
Total Total 1460 1459 99.99 99.99

Graphiques



SalePrice (Continuous)

Sale price
[FR] Prix de vente

Tableau

set Min. 1st Qu. Median 3rd Qu. Max. NA’s
train 34900 129975 163000 214000 755000 0
test Inf NA NA NA -Inf 1459

Graphiques

## `geom_smooth()` using method = 'gam'